Categories

Summary

Course Overview

In the Big Data course at the British Academy For Training And Development, trainees will learn how big data is driving organizational change and the key challenges organizations face when trying to analyze massive data sets.

Trainees will explore fundamental techniques such as data mining and stream processing. They will also learn to design and implement PageRank algorithms using MapReduce, a programming paradigm that enables massive scalability across hundreds or thousands of servers in a Hadoop cluster. The course will cover how big data has improved web search and how online advertising systems operate.

By the end of this course, participants will gain a better understanding of the various applications of big data methods in industry and research.

Objectives and target group

Who should attend? 

  • Data Analysts
  • Software engineers
  • Programmers

Knowledge and Benefits:

 

After completing the program, participants will be able to master the following:

  • Knowledge and application of MapReduce
  • Understanding the rate of occurrences of events in big data
  • How to design algorithms for stream processing and counting of frequent elements in Big Data
  • Understand and design PageRank algorithms
  • Understand underlying random walk algorithms

Course Content

Course Content

  • The basics of working with big data
    • Understand the four V’s of Big Data (Volume, Velocity, and Variety); Build models for data; Understand the occurrence of rare events in random data.
  • Web and social networks
    • Understand characteristics of the web and social networks; Model social networks; Apply algorithms for community detection in networks.
  • Clustering big data
    • Clustering social networks; Apply hierarchical clustering; Apply k-means clustering.
  • Google web search
    • Understand the concept of PageRank; Implement the basic; PageRank algorithm for strongly connected graphs; Implement PageRank with taxation for graphs that are not strongly connected.
  • Parallel and distributed computing using MapReduce
    • Understand the architecture for massive distributed and parallel computing; Apply MapReduce using Hadoop; Compute PageRank using MapReduce.
  • Computing similar documents in big data
    • Measure importance of words in a collection of documents; Measure similarity of sets and documents; Apply local sensitivity hashing to compute similar documents.
  • Products frequently bought together in stores
    • Understand the importance of frequent item sets; Design association rules; Implement the A-priori algorithm.
  • Movie and music recommendations
    • Understand the differences of recommendation systems; Design content-based recommendation systems; Design collaborative filtering recommendation systems.
  • Google's AdWordsTM System
    • Understand the AdWords System; Analyse online algorithms in terms of competitive ratio; Use online matching to solve the AdWords problem.
  • Mining rapidly arriving data streams
    • Understand types of queries for data streams; Analyse sampling methods for data streams; Count distinct elements in data streams; Filter data streams.

Course Date

2024-12-23

2025-03-24

2025-06-23

2025-09-22

Course Cost

Note / Price varies according to the selected city

Members NO. : 1
£4500 / Member

Members NO. : 2 - 3
£3600 / Member

Members NO. : + 3
£2790 / Member

Related Course

Featured

Internet of Things Training Program

2025-01-27

2025-04-28

2025-07-28

2025-10-27

£4500 £4500

$data['course']